Summarizing Web Sites Automatically
نویسندگان
چکیده
This research is directed towards automating the Web Site summarization task. To achieve this objective, an approach, which applies machine learning and natural language processing techniques, is employed. The automatically generated summaries are compared to manually constructed summaries from DMOZ Open Directory Project. The comparison is performed via a formal evaluation process involving human subjects. Statistical evaluation of the results demonstrates that the automatically generated summaries are as informative as human authored DMOZ summaries and significantly more informative than home page browsing or time limited site browsing.
منابع مشابه
Adaptive Sites: Automatically Learning from User Access Patterns
Designing a web site is a complex problem. Logs of user accesses to a site provide an opportunity to observe users interacting with that site and make improvements to the site’s structure and presentation. We propose adaptive sites: web sites that improve themselves by learning from user access patterns. Adaptive webs can make popular pages more accessible, highlight interesting links, connect ...
متن کاملExtracting and Summarizing Hot Item Features Across Different Auction Web Sites
Online auction Web sites are fast changing and highly dynamic. It is difficult to digest the poorly organized and vast amount of information contained in the auction sites. We develop a unified framework aiming at automatically extracting the product features and summarizing the hot item features across different auction Web sites. One challenge of this problem is to extract useful information ...
متن کاملWeb Database Integration
More and more accessible databases are available in the Web. In order to provide people a unified access to these Web databases and achieve information from them automatically, a comprehensive solution for Web database integration is proposed in this paper. After summarizing the research status in this area, the works which are the focus of my PhD thesis are presented.
متن کاملSemantic Summarization Of Web Documents
Documents summarization techniques automatically extract information from different sources . The main propose of this paper is summarizing documents that retrieve from internet. The propose to capture the document from internet , that document store in database ,extract that documents, use the natural language, in order to retrieve similar information. An overview of the system and some prelim...
متن کاملTerm-Based Clustering and Summarization of Web Page Collections
Effectively summarizing Web page collections becomes more and more critical as the amount of information continues to grow on the World Wide Web. A concise and meaningful summary of a Web page collection, which is generated automatically, can help Web users understand the essential topics and main contents covered in the collection quickly without spending much browsing time. However, automatic...
متن کامل